A Comparison of Techniques for Automatic Clustering of Handwritten Characters

نویسندگان

  • Vuokko Vuori
  • Jorma Laaksonen
چکیده

This work reports experiments with four hierarchical clustering algorithms and two clustering indices for online handwritten characters. The main motivation of the work is to develop an automatic method for finding a set of prototypical characters which would represent well the different writing styles present in a large international database. One of the major obstacles in achieving this goal is the uneven representation of different writing styles in the database. On the basis of the results of the experiments, we claim that a good set of prototypes can be formed from the combined results of the different clustering algorithms. However, the number of clusters cannot be determined automatically but some human intervention is required.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Experiments with a Self-supervised Adaptive Classification Strategy in On-line Recognition of Isolated Handwritten Latin Characters

Results on a comparison of recognition techniques for on-line recognition of handwritten Latin alphabets are presented. The emphasis is on an adaptive classii-cation strategy introduced in this paper. The classiication strategy is based on compressing or distilling a large database of handwritten characters to a small set of character prototypes. The distillation is performed as a clustering pr...

متن کامل

Persian Handwritten Digit Recognition Using Particle Swarm Probabilistic Neural Network

Handwritten digit recognition can be categorized as a classification problem. Probabilistic Neural Network (PNN) is one of the most effective and useful classifiers, which works based on Bayesian rule. In this paper, in order to recognize Persian (Farsi) handwritten digit recognition, a combination of intelligent clustering method and PNN has been utilized. Hoda database, which includes 80000 P...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Implementation of Feed-forward Neural Network Models for Pattern Classification Using Transformation Based Feature Extraction Methods

Automatic recognition of handwritten Hindi characters is a difficult and one of the most interesting research areas of pattern recognition field. A lot of work has been done in this area till date; still it is a subject of active research. Hindi characters are cursive in nature and thus characters may be written in various cursive ways. Characters also show a lot of similar features such as hea...

متن کامل

Online Recognition of Handwritten Korean and English Characters

In this study, an improved HMM based recognition model is proposed for online English and Korean handwritten characters. The pattern elements of the handwriting model are sub character strokes and ligatures. To deal with the problem of handwriting style variations, a modified Hierarchical Clustering approach is introduced to partition different writing styles into several classes. For each of t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002